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Abstract — We investigate the problem of distributed sensors' 
failure detection in networks with a small number of defective 
sensors, whose measurements differ significantly from neigh- 
boring sensor measurements. Defective sensors are represented 
by non-zero values in binary sparse signals. We build on the 
sparse nature of the binary sensor failure signals and propose 
a new distributed detection algorithm based on Group Testing 
(GT). The distributed GT algorithm estimates the set of de- 
fective sensors from a small number of linearly independent 
binary messages exchanged by the sensors. The distributed 
GT algorithm uses a low complexity distance decoder that 
is robust to noisy messages. We first consider networks with 
only one defective sensor and determine the minimal number 
of linearly independent messages needed for detection of the 
defective sensor with high probability. We then extend our 
study to the detection of multiple defective sensors by modifying 
appropriately the message exchange protocol and the decoding 
procedure. We show through experimentation that, for small 
and medium sized networks, the number of messages required 
for successful detection is actually smaller than the minimal 
number computed in the analysis. Simulations demonstrate that 
the proposed method outperforms methods based on random 
walk measurements collection in terms of detection performance 
and convergence rate. Finally, the proposed method is resilient 
to network dynamics due to the effective gossip-based message 
dissemination protocol. 



L Introduction 

Over the past years we have witnessed the emergence of 
simple and low cost sensors. This has led to wide deploy- 
ment of sensor networks for monitoring signals in numerous 
applications, for example in medical applications or natural 
hazard detection. However, sensor networks have often a 
dynamic architecture with loose coordination due to the cost 
of communications. This raises new demands for collaborative 
data processing algorithms that are effective under network 
topology and communication constraints. In general, a sensor 
network is represented as a connected graph Q ~ (V, £), 
where vertices V = {si}f^i stand for the S sensors and edges 
£ determine sensors' connectivity. For instance, if two sensors 
Si and Sj lie within each other's communication range, the 
edge e^.j e £ has a nonzero value. Fig. [T] illustrates a setup 
where sensors capture a smooth physical phenomenon (e.g., 
spatial temperature evolution) and generate messages that are 
eventually gathered for analysis. When a sensor is defective, its 
measurements are inaccurate and can contaminate the signal 
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Fig. 1. Ad-hoc sensor network measuring a smooth physical phenomenon. 



analysis. It thus becomes important to detect the defective 
sensors in the network, so that their erroneous values do 
not impact the accuracy of the underlying data processing 
applications. 

The detection literature can be mostly classified into central- 
ized and distributed methods. Most of the works on detection 
methods for binary sparse signals mainly deal with centralized 
systems. The pioneering work in fj] targets medical applica- 
tions. It proposes a simple idea of pooling blood samples and 
observing the viral presence in a set, instead of performing 
tests on every single blood sample separately. Typically, the 
main target is to minimize the number of tests required to 
identify all the infected samples, while keeping the detection 
procedure as simple as possible. This paradigm is known as 
Group Testing (GT) and has been proposed half a century 
ago. GT has been studied more recently in the context of 
sensor networks for detection of malicious events IJ). The 
detection approaches differ in scenarios with errors, inhibitors 
or combinations of them and the detection algorithms are 
rather naive |i3j. Defective sensors are detected by iterative 
elimination of identified non-defective sensors from the test 
outcomes. The detection time is typically of order 0{SB), 
where B is the number of tests and S is the total number of 
sensors. Particular test design methods improve the effective 
time for detection in centralized systems. For example, a useful 
test matrix property called A'-disjunctness property (i.e., the 
Boolean sum of every K columns does not result in any other 
column), speeds up the decoding process. This property is 
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used in code designs, for e.g., for superimposed codes 0, 
||5| . Further, a random efficient detection is proposed in f6\ 
with a decoding time of 0{7r{B) ■ B log^ B + 0{B'^)), where 
B = 0{K^ log S) is the number of tests and tt denotes a 
polynomial. In our knowledge, this represents the state-of-the- 
art decoding performance. 

In sensor networks, test design is contingent to the commu- 
nication limitations. Works that consider constraints imposed 
by the sensor network topology in GT framework are not 
numerous. The authors in Q propose to form tests by a 
random walk process on well-connected graphs. The minimal 
number of tests required for detection in this case depends on 
the random walk mixing time. A bipartite graph structure is 
considered in fSI with a two-stage hybrid detection method. 
Here, a subset of defective items in the first stage is determined 
by pre-designed tests, while the remaining items are tested 
individually in the next step. Data retrieval for topology- 
adaptive GT is studied in tQ] where a binary tree splitting 
algorithm is proposed. The above methods use centralized 
decision algorithms which are not appropriate for large-scale 
sensor networks or networks with a dynamic topology because 
of the high communication costs. In those scenarios one rather 
needs to use distributed detection methods. To the best of 
our knowledge, however, no analysis on distributed detection 
methods which consider sparse and binary test signals are 
available. The distributed methods are rather employed for 
non-sparse signal detection with explicit network and mes- 
sage constraints. Such methods generally employ statistical 
decoders ifTol . For example, a Bayesian approach in ifTTI 
proposes to compute a detection score for a priori defined 
sets of hypothesis, which depends on the received messages. 
The hypothesis with the highest score drives the decision. 
The binary event detection problem for hierarchically clustered 
networks is proposed in lfT2l where the cluster decisions are 
fused to make a final decision. Surveys on similar methods 
can be found in lfT3l. lfT4l. 

In this paper, we propose a novel distributed sensors' failure 
detection method that employs a simple distance decoder 
for sparse and binary signals. We assume that at most K 
sensors are defective out of S sensors in the network, where 
K <^ S. Therefore, the defective sensor identification problem 
boils down to a sparse binary signal recovery, where nonzero 
signal values correspond to defective sensors. Our approach 
is based on GT methods that are commonly applied for 
centralized systems. The core idea is to perform low-cost 
experiments in the network, called tests, in order to detect 
the defective sensors. The tests are performed on pools of 
sensors by a set of sensors called master sensors. The master 
sensors request sensor measurements from their neighbors. 
Each sensor responds to this request with probability q. Due 
to the smoothness of the measured function, non erroneous 
neighbor sensors typically have similar measurements. Each 
master sensor compares the sensor measurements based on a 
similarity measure (e.g., thresholding) to detect the presence 
of defective sensors in its vicinity. The result of this test takes 
a binary value, which might be possibly altered by noise. The 
tests and their outputs together form the network messages that 
are communicated to neighborhood of the master nodes. The 



messages in the sensors are then disseminated in the network 
with a gossip algorithm (rumor mongering ) IBl that follows 
a pull protocol |[T6l . ITtI . ITSl . Each time a new message 
reaches the sensor, its value is linearly combined with the 
message available at the current sensor in order to increase the 
diversity of information in the network. The message design 
and dissemination phases are repeated for several rounds. Due 
to the probabilistic test design and message dissemination we 
employ a simple distance decoder (e.g., Hamming decoder) 
that is able to detect defective sensors, as long as the number of 
messages is sufficient. We analyze the detection failure bounds 
and analytically derive the conditions needed for successful 
failure detection in the case of a single defective sensor. Then, 
we provide the error bounds for detection of multiple defective 
sensors. We show that the number of linearly independent 
messages required for detection is smaller in practice than the 
theoretical bounds obtained in our worst case analysis. We 
finally provide simulation results in regular and irregular net- 
works. The experiments outline the advantages of the proposed 
detection method compared to other binary signal detection 
algorithms based on the random walk measurements gathering. 
Our algorithm outperforms random walk detection methods 
both in terms of the detection accuracy and convergence rate. 

This paper is organized as follows. Section reviews the 
centralized Group Testing framework. Section |III] proposes a 
novel distributed detection method. It describes the message 
formation and dissemination processes in sensor networks 
and discusses the detection problem for single and multiple 
defective sensors. Section |IV] presents the simulation results. 

II. Centralized detection with probabilistic 
Group Testing 

We first review the centralized detection of sensor failures 
with methods based on GT. This framework is the ground 
for the novel distributed GT algorithm discussed in the next 
section. Detection is the identification of a subset of defective 
sensors whose measurements deviate significantly from those 
of the sensors in their vicinity. Based on the test construction, 
the methods for detection are categorized into deterministic 
and probabilistic algorithms. General centralized deterministic 
GT methods assign each sensor to the set of tests prior to per- 
forming them, where the tests are designed to assure detection. 
This approach however is not feasible for networks with large 
number of sensors. To alleviate this, probabilistic GT has been 
proposed in fT9l . We focus on test design methods that do not 
use the knowledge of realized test outcomes for novel test 
designs, since they are more appropriate in realistic settings. 

Hereafter, we adopt the following notation: matrices and 
vectors are represented with boldface capital letters (M, m) 
and their elements are given with lowercase letters {Mij,mi). 
Calligraphic letters are used to denote sets (Q), while | • | 
represents the number of elements in a set. The i-th column 
and the i-th row of M are represented with M:_i and :, 
respectively. 

GT aims at detecting defective items in the set based on the 
outcome of binary tests. Nonzero entries of a S'-dimensional 
binary vector f G Ff indicate the defective sensors. F2 is 
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a finite field of size two and f is a K-sparse signal, where 
K S. The tests preformed on sensor measurements are 
represented with a Bx S dimensional matrix W. The nonzero 
entries of ^ G Ff refer to the sensors that participate in the 
i-th test. The boolean matrix multiplication operator is denoted 
with ®. Then, the binary tests results are denoted with the test 
outcome vector g e : 



g = W®f. 



(1) 



The design of the matrix W is crucial for reducing the 
number of required tests for the detection of defective sensors. 
This design resembles the design of generator matrices of 
LDPC codes 1201 . In the Tanner graph representation of LDPC 
codes, the LDPC encoded symbols are partitioned in check and 
variable nodes, where the check nodes are used to detect errors 
introduced during transmission of LDPC encoded symbols. 
Motivated by this similarity, the test matrix W is constructed 
as mi: 

1, with probability q, 
0, otherwise. 



(2) 



The sensor participation probability is denoted with q. Such a 
design for the test matrix assures that with high probabiUty, 
any test matrix column is not a subset of any union of up to 
K columns {disjunctness property). In other words, a matrix 
W is called X-disjunct if no column W:.i of W lies in the 
sub-space formed by any set of K columns W: j with j ^ i. 
This property enables fast decoding with a distance decoder 
(i.e., Hamming distance). The distance decoder exploits the 
knowledge of the test outcome vector g and the test matrix or 
the seed of the pseudorandom generator that has been used for 
generating the random test matrix. Next, we discuss in more 
details the disjunctnesss property and the detection probability 
in centralized GT, since they represent the starting point of the 
decentralized detection method proposed in the next section. 

We first formally define the disjunctness property 1191 of 
test matrices that results in low-cost detection. This property 
assures that the union of any set of at most K different 
columns of W differs in at least e positions from any other 
column of W. 

Definition 1: Disjunctness property: A boolean matrix W 
with S columns W:,i, W:,2, . . . , W:_s is called {K^e)- 
disjunct if, for every subset T of its columns, with |r| < K: 

I supp{W.,^M y supp{W.,^j)) |> e, e {1, . . . , 5} 

(3) 

where supp{W ■4) denotes the nonzero elements (support) of 
the column W:_i and \ is the set difference operator. 

Disjunctness is an important property since it permits to 
analyze the detection probability. The connection between the 
structure of disjunct matrices and detection of defective items 
is given by the following proposition |fT9l . 

Proposition 1: If the test matrix W fulfills a [K, e)-disjunct 
property, the detection problem is resolved in the X-sparse 
vector f with error parameter e. 

The disjunct matrix parameter e represents the distance de- 
coder threshold for detection. The decoder accumulates the 
number of entries in a column of the (X, e)-disjunct test matrix 



that are different from the outcome vector g. The columns of 
W that achieve the lowest Hamming distance correspond to 
defective sensors. For any column W:,i of the test matrix W 
that is {K, e)-disjunct, the decoder verifies if: 



supp{W ■,^i)\supp[^) |< e, 



(4) 



where g = W CE) f is the vector of test outcomes. In other 
words, the decoder counts the number of positions in the 
column W: i for which the union of distinct columns differs 
from the set T in order to detect defective items. The columns 
of the vector f are inferred as nonzero iff the inequaUty Q 
holds. 

Finally, the detection performance can also be analyzed 
in noisy settings, when the test matrix satisfies disjunctness 
property. The noisy settings results from the alternation of the 
nonzero entries in W with probability 1 — p, as represented 
in Fig. (|2|i. 




Fig. 2. Representation of noise influence of binary symbols in the test 
message. Non-zero values in the test matrix are flipped with probability 1 — p. 



The following proposition provides the required number of 
measurements in centralized detection for successful decoding 
with the distance decoder in noisy settings |fT9l . 

Proposition 2: Let the test matrix W be {K, e)-disjunct. 
The distance decoder successfully detects the correct support 
with overwhelming probability for a iiT-sparse vector f in a 
noisy environment when the number of tests is equal to i? = 
0{K\ogiS)/p^). 

The insights provided by the above results are used in the 
analysis of the novel distributed GT algorithm proposed in 
the next section. 

III. Distributed detection method 
A. Sensor network message design and dissemination 

In this section, we propose a novel distributed failure de- 
tection algorithm and analyze its performance. The algorithm 
is based on a novel test design and message dissemination 
strategy in a distributed GT framework. The sensors iteratively 
create and disseminate messages in two-phases, denoted by 
t] and tjj. During the first phase tj, the sensors obtain 
messages that estimate the presence of defective sensors in 
their neighborhood. In the second phase tjj, the sensors 
linearly combine messages and exchange them employing 
a gossip mechanism. One round of our iterative distributed 
detection algorithm consists of these two phases. They are 
illustrated in Fig. [3] and described below in more details. 

The first phase tj in round t represents the message con- 
struction process illustrated in Fig. [3fa). L master sensors 
cluster the network into disjoint subsets Vi C V, I ~ 1, . . . , L. 
Clustering is used to bound the search space of decoder, 
as explained in the following subsections. Measurements of 
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(a) Phase t/: Message design. (b) Phase tu: Message dissemination. (c) Communication phases. 

Fig. 3. Illustration of the message design and dissemination through the sensor network, (a) Message formation based on local sensor measurements: Full 
and dashed arrows coiTespond to the steps of the message design, respectively. In the first step, the master sensor collects the sensor measurements from 
its neighbor sensors {si, . . . ,54} and forms the message W; In the second step, the message is propagated from the master sensor to its 

neighbor sensors, (b) Message dissemination based on a gossip algorithm with pull protocol, where the sensors request the messages from their neighbors 
chosen uniformly at random, (c) Rounds of communication in our iterative detection algorithm consist of the message design (t/) and the message dissemination 
(tjf) phases. 



neighbor sensors do not vary significantly when the sensors are 
not defective when the signal under observation is smooth over 
the sensor field. The master sensors locally gather the readings 
or measurements from sensors that participate in their test. 
Each sensor randomly participates in the test with probability 
q, as given in Eq. (|2]i. The master sensor estimates the presence 
of defective sensors within its neighborhood and then attributes 
a binary value /(s^) G f to each sensor in the neighborhood. 
The value f{si) — 1 denotes that the sensor Si is defective. 
Noise alternates non-zero bits with the probability I — p, as 
shown in Fig. (|2]i. The test outcome at master node I is finally 
computed as: 
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W, 



1, 
0, 



sensor(s) G K. 
otherwise. 



(5) 



where the binary matrix operator ® is composed by and 
© and stand respectively for the bitwise OR and the bitwise 
addition operators, where K, is the set of defective sensors. 
The message formed by a master sensor I during the phase tj 
consists of the outcome gi and the test participation identifier 
Wi_:. The message {gi{t~),Wi,;{t~)) is sent to the neighbor 
sensors, which concludes the phase tj. 

During the phase tjj, the messages created in the phase 
tj are disseminated within the network. The phase tu is 
illustrated in Fig. [3b). Every sensor i G {1, . . . , 5} requests 
the message formed at the previous round from its neighbor 
j, chosen uniformly at random, following a gossip mechanism 
with pull protocol. Next, each sensor j responds to the message 
request that it has received from sensor i by sending its 
message from the previous round. This process is performed 
only once per round. The sensor i further combines these 
messages as follows: 



g,{t)^g,{t )(Bg,{t-l), 

W,,(t) ^ W,,(t-)©W,-:(i-l), 



(6) 



where gj{t~l) denotes the sensor outcome value of the 
neighbor j at the previous round {t — 1). The vector .(t) 
represents the test indicator vector at the sensor i in round t. 
Since the messages are created probabilistically, the message 
combination in the different rounds assures that an innovative 



message reaches sensors at every round with high probability. 
A toy example of the dissemination phases is illustrated in Fig. 
|4] In this example the sensor S2 at round t pulls the message 
from the sensor si and constructs a new message according 
to Eq. (|6]l. 

{gi{t-l),W\t-l)) ^{g2{t),W\t)) 



e- 



m. mm ) 



(m. imm ) 



Fig. 4. The message formation at sensor S2 in round t. We assume that 
sensor S2 pulls sensor si to send its previous round values (round t — 1). We 
assume that the sensor S3 is defective f = [0010. . .]. The outcome value 
and the test identifier vector are formed by bitwise XOR. 



In a matrix form, the process of message formation and 
transmission in B rounds of our algorithm at any sensor in 
the network is represented as: 



g = W®f, 



(7) 



where the sensor identifier matrix W — 
[Wi_:(i); . . . ; WB,:it)] is of size B x S. The latter equation 
resembles to the outcome computation in the centralized 
GT case. However, in the distributed GT the tests represent 
linear combinations of test vectors that build disjunct matrix 
with high probability, as given in Eq. To make a clear 
distinction between test matrices in proposed and centralized 
setup, we assume that an oracle has a direct access to the 
master nodes. Let Ci^-, denote the concatenation vector of test 
realizations at master nodes collected by an oracle in the phase 
tj of the round t = i. The matrix C = [Ci.:; C2.: . . . Cb,-.] 
then represents the test matrix over B collection rounds. 
Observe that the matrix C is by construction disjunct, while 
W is built on the boolean addition of rows of C as in Eq. 
(|6]l. The values in W thus depend on the random message 
propagation path, which is obviously not the case in the 
centralized GT algorithm. Note that, for an arbitrary network, 
the number of network rounds required for collecting a 
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particular number of linearly independent tests varies and 
depends on the network topology, the number of master 
nodes L and the test participation probability q. 

Once every sensor has gathered enough test messages, it 
independently solves the failure detection problem finding the 
binary vector f that satisfies the tests in Eq. (|7]i. This solution 
f indicates the defective sensors. This process is analyzed in 
more details below. 

B. Detection of one defective sensor in the network 

We first analyze the case of a single defective sensor (case 
K ~ 1) m the network and study the detection probability 
of our distributed algorithm. To recall, the distance decoder 
used for detection computes the Hamming distance between 
two vectors a and b. The element-wise distance is given by: 

To avoid the false alarms, the decoder threshold e is set to the 
value that is higher than the expected number of noise-induced 
bit flips per columns in the disjunct matrix C |fT9l : 

e = {l + 5){l-p)qB. (9) 

where 5 > is a small constant and B is the number of rows 
in C. Columns of C have in average qB non-zero elements. 
Every non-zero matrix element is flipped with probability (1 — 
p) and the expected number of flips per column is: 

^i = {l-p)qB. (10) 

Recall that the matrix C is by construction a disjunct matrix. 
Proposition [T] states that the detection problem is resolved for 
tests that form a disjunct test matrix. However, the messages 
available at sensors in the network form a test matrix that is 
obtained by linear combinations of disjunct matrix rows and 
not disjunct matrix rows itself. Nevertheless, we show below 
that the distance decoder detects defective sensor with high 
probability under certain conditions. 

The formal propositions for detection with high probability 
are given below. First we show that the proposed algorithm 
in the network with a single master node designs a {K, e)- 
disjunct matrix C during the phase </. Next we show that 
in a single cluster network linear combinations of rows in C 
preserve distances between the test outcome and the column of 
the defective sensor in the test matrix. We then build on these 
two propositions to analyze the number of messages needed 
for the distributed detection of a single defective sensor, which 
is given in Proposition |7] 

We first show that for a network with a single master node 
(L = 1) and probabilistic message design in the phase tj, a 
{K, e)-disjunct matrix C is built with high probability. This 
case boils down to the centralized collection of data described 
in lfT9l and the defective sensor can be detected by a distance 
decoder as shown in Proposition [T] 

Proposition 3: For a single-cluster network, the message 
design over the phase ti of our proposed method builds a 
{K, e)-disjunct matrix C with high probability for an arbitrary 
K and e defined as in Eq. (|9]l. 



Proof: We show that the probability that the number of 
rows with a good disjunctness property G of C is smaller 
than e and we follow the development proposed in [19]. The 
sensor participation probabiUty q in a test defined as in Eq. 
(|2]i. A row of the matrix C^.: is considered to have a good 
disjunctness property if a single symbol "1" occurs, while the 
rest K — 1 values are equal to zero. The probability of such 
an event is equal to /ii = q{l — q)^~^. The random variable 
that marks the total number of rows with such a property is 
denoted with G. The distribution of G is binomial with a mean 
value IJ.2 = MiS. We show that the probability of having less 
than e rows with good disjunctness property is small under 
the assumption that e < /_t2. We limit this probability by a 
Chernoff bound as: 

P{G < e) <e'^~^ =e ^ 2ii^,)K-i _ (jj) 

Knowing that 2 < e < 3 and that constant a > 0, we get 
2-a > g-a > Since 3"" < (1 + < 2"" holds, 

7 = ''^"''^''2(1-9)^^^1^^'^'^^'' bounded. For the parameter 
choice in ||T9l {S, a) = (|, |), the value 7 = 0{p'^). Therefore 
this probability can be designed to be arbitrary small: 

P(G<e)<e-^^/^' = e-°(^f'/^). (12) 

■ 

Then we show that linear combinations of rows of {K, e)- 
disjunct matrices C in a network with a single master node 
preserve the Hamming distance only between the column of 
matrix W: fe that corresponds to the defective sensor Sk and 
the outcome vector g. 

Proposition 4: Let C be the {K, e)-disjunct matrix created 
over consecutive B rounds in a single-cluster network during 
the phase tj. Linear combinations of messages generated 
during the phase tjj, performed as in Eq. (|6]l, preserve the 
Hamming distance between the column of obtained matrix 
W:.fc that corresponds to the defective sensor Sk and the 
outcome vector g. 

Proof: We first analyze the case that leads to a decoding 
failure for {K, e)-disjunct matrices following a development 
similar to fT9l. We prove further that linear combinations of 
rows in such matrices preserve vector distances between the 
outcome vector and the column of W that corresponds to the 
defective sensor 

A decoding failure with a distance decoder occurs in a 
{K, e)-disjunct matrix when the number of flips of column 
elements of C is higher than e. The probability of occurrence 
of a single flip is equal to fi^ = q{l — p). Let F denotes 
the number of flips in the columns of the matrix. Hence, the 
expected number of flips per column is given in Eq. ( fTOl l. 
We want to compute the lower bounds for the event that more 
than (l + (5)/i flips occurred in the column of the matrix, where 
(5 > 0. Applying the Markov inequality: 

P(f >(l+J)^)<inf /^g^^'^ ' (13) 

and plugging the probabiHty of the single flip event: 

, . ^ f 1, with probability {I - p)q, 
^ \ 0, with probability 1 - (1 - ^ ' 
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to the expectation term of the previous equation leads to: 

p(F < ,„,nsi(i-p).^+(i-(i-rt.)i 

= ^^^^^^^ THTT, -■ (15) 

If we set (1 —p)q{e'^ — 1) = a; and plug the inequality 1 + a; < 
e^, we obtain: 



P{F > (1 + S)n) < inf 



<i>0 e''(i+'^')^ 

p(l-p)gmi(e''-l) 

inf 



g(i(l+<5)At 
eA'(e''-l) 



For the constant d = log {I 

P(i^>(l + <5)M)<( 



(5), we finally obtain: 



(16) 



„M<5-A'(l+'5)log(l+5) 



Observing that log{l + 5) > the Eq. becomes: 



(17) 



P{F> {l + 5)ii) <e— 



(18) 



The outcome value g depends on the presence of a defective 
sensor Sk in the test. We prove here that the distance between 
g and the A;-th column W:^/£ does not increase more than e 
during tu, while this is not true for the rest of the columns. 
When sensor j sends its message to sensor i during the round 
t, we have: 

dist{g,{t),W,M{t)j 

= dist{g,{t-) ® gAt - ^),W^,k{t-) ®Wj,h{t - l)j 

= dist{g,{t-),W^,k{t~)j ®dist{gj(t-l),W,,kit~l)^,{'^9) 

where the first equality results from Eq. The second 
equality follows directly from the fact that the values of g(i^) 
and the columns W:,fc(i^) are identical for the defective 
sensor due to Eq. (|7|i. Since these two columns can initially 
differ at e positions due to noise flips, the overall distance 
between the vectors g(t^) and W: /j(<^) is at maximum e 
given in Eq.®. ■ 
We consider now networks with L master sensors and an 
hypothetical centralized data collection. We assume that L 
master nodes cluster the sensor network in disjoint subsets, 
where every sensor belongs to exactly one cluster The master 
nodes perform message design over the rounds t/ as proposed 
by our algorithm. We show now that the tests gathered from 
the L different clusters build a disjunct matrix, where each 
cluster relates a (A', e)-disjunct matrix. 

Proposition 5: The diagonal matrix C ^ 
diag{Ci, . . . ,Cl) obtained from (/v , e)-disjunct matrices 
C = {Cjjf^j is at least (A', e)-disjunct. 

Proof: Proof follows directly from the Definition [T] and 
the disjunctness property of the matrices in C. ■ 
We consider now the gathering of messages that are linearly 
combined over successive rounds of our detection algorithm. 
Uniform gathering of linearly combined messages at L clusters 
by a hypothetical centralized decoder results in detection of 



the defective sensor with high probability when the number 
of received messages is sufficient. 

Proposition 6: When the {K, ei)-disjunct matrices C ~ 
{Ci}f^i are linearly combined as in Eq. (|6]l, where e = 
SiLi ™d Q = Si^i resulting test matrix permits 

detection by a distance decoder with high probability as 
long as it contains in total B > 0{K log{S)/p^) messages 
collected from clusters chosen at random. 

Proof: We first show that a diagonal matrix constructed 
from (K, ei)-disjunct matrices of the set C is (A', e)-disjunct. 
Next, we recall the Proposition |4] and finally, we show that 
the B measurements assure a good disjunct property of 
cluster matrices. Let the number of rows for all matrices be 
B = 0{K\og{S)/p^). The parameters e and are defined in 
Eq. and e = J2f=i = + S){1 - p)Bq, the diagonal 
matrix of (AT, e^) matrices is (AT, e) disjunct. The next part of 
the proof follows from the Proposition |4] which states that a 
matrix whose rows are formed by linear combinations of rows 
of (AT, e)-disjunct matrix permits detection with a distance de- 
coder Finally, we need to prove that for a given C; the disjunct 
property holds given that at least B messages are available. 
For this purpose, we follow a development similar to |fT9l and 
consider maximum number of sensors in clusters is S„iax = S. 
The probability bound given in Proposition [3] should hold for 
all possible choices of a fixed set of T out of S columns: 
\JtP{G < e) < Se'^''^. This probabiHty can be arbitrary 
smaU, e.g., in case B > -^^^^ = 0{K log S/p^). Further on, 
the condition in Eq. (fTsT l. which gives the probability bound 
that the number of flips in any AT out of T columns exceeds 
a threshold value e is also bounded. It reads 



UxP(i^ > {l + S)^l) < Ke- 



'^{l-p)qKlog{S) 



where the last equality is obtained by using Eq. ( fTOb . This 
probability is small for the sufficiently large value of i? = 
0{Klog{S)/p^). m 

We now analyze the proposed distributed algorithm and 
consider the detection requirements for every sensor in the 
network. We show that the test messages collected by the 
sensors during the transmission rounds enable failure detection 
by the distance decoder with high probability if the number 
of messages is sufficient, where the decoder operations are 
performed locally at sensors. 

Proposition 7: We assume that L master sensors partition 
the sensor network in disjunct parts. Test realizations within a 
cluster form test vectors. Over the rounds, these vectors create 
(A', e)-disjunct matrices C = {C,;}|L]^: 

1, with probability a^, 
0, otherwise. 



(20) 



where q = J2i=i1i- Messages (gj,Wi_:) arrive at all the 
sensors in the network in our proposed algorithm, as described 
in the previous section. If the above assumptions hold and 
if the number of linearly independent messages received per 
cluster at every sensor in the network is at least B/L, where 
B>0{K \og{S) /p^), the probability that sensors fail to detect 
the defective sensor by the distance decoder tends to zero as 
S ^ CO. 
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Proof: The message collection method does not influence 
the decoder performance, since the number of per-cluster 
measurements is sufficient for decoding with high probability. 
Therefore, the proof follows from the proof of Proposition |6] 



C. Detection of multiple defective sensors in the network 

We analyze now the distributed detection of multiple defec- 
tive sensors, where the number of defective sensors is much 
smaller than the total number of sensors. We propose here 
to slightly modify our distributed algorithm and to limit the 
decoder search space to be able to apply the Hamming distance 
decoder. The protocol modification and the adaptation of the 
distance decoder are described below. We assume that sensors 
completely differentiate between sensors in the network that 
belong to particular clusters and that at most one defective 
sensor is located in a given cluster This knowledge limits the 
size of the decoder search space. 

The proposed protocol is first modified as follows to 
deal with multiple defective sensors. A decoder error occurs 
when two or more messages with positive test outcomes are 
combined together during the phase t//, since the distance 
preserving property defined in Eq. ( fT9] l is not guaranteed in 
this case. Since the number of defective sensors is very small 
compared to the total number of sensors, this event however 
occurs rarely. We explain the protocol modification with a 
simple example. Let the sensor i pull the message from the 
sensor j, where both sensor test outcomes have nonzero values. 
Instead of combining the messages as in Eq. we simply 
buffer the new message of sensor i and consider the message 
from sensor j at previous round as the final outcome of the 
phase t: 

W^,it)=Wj,{t-l). (21) 

At the first subsequent round r > t + 1 of our distributed 
algorithm where both messages .gi(r) and gj{T — 1) have non- 
zero values as test outcomes, gi (r) is replaced by the message 
buffered in node i. The rest of the protocol remains unchanged. 

Then the decoding proceeds in two main steps. First, the 
appropriate unions of test matrix columns are created to form 
a search set space and second, the Hamming distance between 
the test outcome vector and the vectors of the search set 
are computed. The minimum Hamming distance indicate the 
solution of the detection problem. The outcomes g = [go gi]"^ 
collected at some sensor are divided into two sets, i.e., the 
negative and positive outcome vectors go and gi, respectively. 
Subsequently, the rows of the test matrix W form two sub- 
matrices Wq and Wi and Eq. (Q is rewritten as: 








■ Wo 







■ fo ■ 


1 







Wi 




fi 



(22) 



We eliminate non-defective sensors from Wi using the knowl- 
edge from Wo and obtain W^^. The columns of interest are 
those columns of W^^ which contain at least one non-zero 
value. These columns are classified in sets TL, whose size 
depends on the complete or partial sensor knowledge about 



cluster affiliation of other sensors in the network. Columns 
belonging to the same cluster are grouped together in a set 
Hi, where i £ {1, . . .,L} and L is the number of clusters. 
The search space U consists of vectors that are obtained from 
unions of up to K columns, where each column is picked 
from a different set Hi- We choose up to K columns, since 
the number of defective elements can be smaller than K by the 
problem definition, while the selection of at most one column 
from a particular Hi comes from the assumption that at most 
one defective sensor exists in each cluster For instance, let the 
number of defective sensors and clusters be [K, L) = (2, 2). 
Let Til contain hi and H2 contain /12 columns. Then the 
search space size has in total /ii/i2 + /ii + /12 elements, where 
hih2 = ('\^) • {^■{') denotes the number of unions of if = 2 
columns and single column subsets are chosen in /11+/12 ways. 
Distance decoding is performed between gi and elements of 
the set U, starting from the vectors that are created as unions 
of K columns towards the smaller number of column unions. 
If no solution exists for a particular value of K, we perform 
the decoding for vectors built from K — 1 column unions of 
Hi- If no unique solution is found, we encounter a decoding 
failure. 

Now that the decoder has been described, we analyze in 
details the number of required messages that are necessary for 
detection of multiple defective sensors with high probability. 

Proposition 8: Under the assumption that at most one de- 
fective sensor is present in the cluster, that the number of 
available linearly independent messages at all sensors is at 
least B/L per cluster, where B > 0{K \og{S) /p^) and that 
sensors know membership identifiers of all the clusters in the 
network, the distance decoder detects defective sensors at all 
sensors in the network with high probability. 

Proof: To recall, the transmission protocol ensures that 
the assumptions imposed by Proposition [7] hold for one de- 
fective sensor. Then, due to the assumption that at most one 
defective sensor is present in one cluster and that there is at 
most one defective sensor active in the test, we can form the 
set of solutions for the multiple defective case, which has a 
unique solution. Distance decoder between the outcome vector 
and a limited set of vectors that form a full search space 
can therefore find the appropriate solution. In other words, 
this procedure is identical to per-cluster decoding, where each 
cluster has at most one defective element, so the Proposition 
|7] can be applied. ■ 

Proposition 9: Under the assumption that one defective 
sensor at most is present in the cluster, that the number 
of available linearly independent messages at all sensors 
in the network is at least B/L per cluster, where B > 
0{K log(5) /p^) and sensors know the partial set of identifiers 
of the clusters in the network, the distance decoder detects 
defective sensors at all sensors in the network with high 
probability. 

Proof The search space U created in this case is larger 
but it contains the solution. Now the proof is identical to that 
in the previous proposition. ■ 
Finally, we show that the assumption of at most one 
defective sensor occurrence per cluster is reasonable. We here 
bound the probability that at least two defective sensors occur 
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within any cluster. An erroneous message is generated in a 
cluster that contains more than one defective sensor when only 
a fraction of defective sensors participate in the test actively 
and we denote the probability of such an event with P{E). If 
defective sensors participate in the test, the distance within the 
column that signifies these vectors and the outcome result does 
not change. The same occurs if none of the defective sensors 
participate in a test. Due to the protocol modification, only 
one cluster may generate the erroneous message per round. In 
total we assume there are m g {2, . . . , isT}, m < n defective 
sensors and that clusters contain 71=^ sensors. Then, the 
probability of decoding error in one cluster Pci{E) is equal 
to: 

K K 

Pd{E) = P{n,q\m)P{m) - ^ P{n\m)P{q\m)P{m), 

m—2 m—2 

(23) 

due to independence of parameters n and q. P{m) represents 
the probability that some cluster contains m defective sensors, 
P{n\m) = (Jj'J is a probability of choosing m defective 
sensors within a cluster with n sensors and P{q\m) denotes 
the conditional probability of the error occurrence in a cluster 
with m defective sensors and test participation probability q. 
We assume that m takes a value from the set {2,..., A'} 
with uniform distribution, so P{rn) = j^^- Next, P{q\m) = 
1 — g'" — (1 — q)"^ (Appendix |A|. Total error probability for 
L clusters is bounded by P{E) < L ■ Pci{E), so: 

K 



P{E) < L 



A' - 1 ^ 



(1-9)' 



(24) 



m—2 



We use the well known binomial coefficient inequality ( "J > 
(m)™ that holds for n,m > where m < n and 1 — — 
(1 - q)™ <l,qe {0, 1} to bound the value: 



1 - (J™ - (1 - qY 

n 



< 



(l-qY 



i-Y 



< 



(25) 



We rewrite ( — )™ by using a well known inequality as 
(^)™ = (1 < e"-"\ Plugging these expressions 

to the previous expression and performing simple calculations 
we finally obtain: 



P{E) < 



L 



1 



(26) 



A" - 1 e - 1 

For the network values (5, L, K) = (70, 5, 3) this probability 
is bounded with P{E) < 1.1 • 10"''. 

The distance decoder error probability due to our assump- 
tion that only one defective sensor is present in the network is 
small. In addition, the decoder threshold value can be updated 
to increase the robustness. We increase the value of threshold 
parameter as e = e + S^, where 6^ — P{E)E{gi) and E{gi) 
is the expected number of non-zero test outcomes. It is set to 
the total number of observed positive test outcomes. 

IV. Performance evaluation 

A. Setup 

In this section, we investigate the performance of our 
distributed detection method denoted as GP in various sce- 
nario We first examine the influence of the different network 



parameters in the rate of dissemination of messages. Next, we 
examine the decoding probability for both single and multiple 
defective sensor(s) detection. The number of system rounds 
required to collect the necessary number of messages for the 
accurate decoding varies with the topology. The simulations 
are performed for fully connected, fc-connected and irregular 
graphs. Finally, we discuss the number of required linearly 
independent measurements needed for successful detection 
and compare it with the theoretical one. 

We also analyze the performance of several alternative 
schemes, namely a Random Walk method that employs a 
Gossip mechanism with pull protocol (RWGP) and a classical 
Random Walk (RW) detection. A random walk determines the 
path of successive random dissemination message exchanges 
between neighbor sensors. In the RWGP method, the random 
walk is initiated at L sensors (equivalent to the master sensors 
in the GP method) and terminates after a pre-determined num- 
ber of rounds. The sensors create messages from the sensor 
measurements collected along the random walk path. These 
messages are transmitted with the gossip algorithm that uses 
a pull protocol. Note that, for identical choice of the sensors 
over rounds, RWGP and GP are identical. The RW method 
initiates the raw (uncompressed) measurements collection in L 
random sensors and completes it in a given number of rounds. 
Every sensor that lays along the random walk path stores the 
values of all sensors along the transmission path. When all the 
sensors receive all the data, the process terminates. 

The GT algorithm is also compared with a Store-and- 
Forward (SF) and a Greedy Store-and-Forward (GSF) method 
that employs pull protocol. Both algorithms disseminate raw 
sensor measurements. For the SF method, upon receiving a 
message request, a node responds by forwarding randomly 
chosen messages from the available set of messages. In GSF, 
each sensor randomly requests the innovative measurements in 
a greedy manner from its randomly chosen neighbor sensor 
This procedure involves additional message exchange among 
sensors in every round. 

We analyze the performance of these algorithms in fully 
connected, k-regular graphs and irregular networks. For irreg- 
ular sensor networks construction, we place sensors randomly 
in a unit square area. Sensors that lay within a certain radius 
can communicate and exchange messages directly. In each 
case, we build 10 different network realizations and for each 
such realization we perform 100 independent simulations. The 
results are averaged over all simulations. 

B. Influence of the master node selection process 

First, we study the influence of networks' capability to 
generate innovative messages on the decoder performance. We 
consider two different methods for selecting master sensors: 
random master sensor selection (RM) and deterministic master 
sensor (DM) selection. Fig.|5]illustrates the detection probabil- 
ity and the achieved average rank with respect to the number 
of message dissemination rounds, for fully connected graphs 
with 5 = 20 sensors and one (AT ~ 1) defective sensor We 
observe that the performance depends on L and a = qK for 
both RM and DM. These values should be selected properly 
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Fig. 5. Simulation results for fully connected graphs with 5 = 20 sensors, K = 1, where RM and DM denote the random and deterministic selection mode 
of master sensors, respectively. Top row: Probability of defective sensor detection. Bottom row: Average rank of messages received per sensor. Column (a): 
fixed values of the master sensors (L = 5). Column (b): fixed values of the sensor participation constant (a = qK = 0.7). 



in order to maximize the information diversity in the network. 
Specifically, we observe that RM achieves the maximum 
message diversity for a = 1 (maximum value) since the 
diversity of messages in this case is maximized by construction 
in Fig. |5] We can also note that the number of clusters does 
not affect significantly the detection performance of RM. On 
the contrary, for DM both parameters L and a are important. 
Small values of a guarantee high message diversity. This is 
due to the fact that DM requires more rounds to receive enough 
messages for detection. In the following, we focus on RM 
selection where possible (that is, for K = 1), as it provides 
higher probability of creating innovative messages. 

C. Detection performance 

We first consider the case of a single defective sensor 
{K = 1). The detection probability and the average rank 
evolution over rounds are examined for fully connected (FG) 
and A;-connected regular networks (RG) with sensors degree 
k G {6, 16}. For all cases, the network consists of S* = 20 
sensors. From Fig.|6]we see that networks with higher number 
of connections achieve faster dissemination of innovative mes- 
sages. We also note that high connectivity value k is beneficial, 
but it cannot drive by itself the performance of our detection 
scheme. It should be combined with appropriate choice of 
network parameters, as discussed earlier For example, RM 
master sensor selection for k = 16 achieves better detection 



performance, compared to that of fully connected graphs. 

In Fig. |7] we illustrate the detection probability for random 
graphs (100 simulations per different graph) with S = 20, 
K ^ 1 defective sensor, L = 5 random clusters and minimum 
sensors' degree fc > 3. We observe that random graphs require 
more rounds in average for successful detection, as expected. 
Also, we observe that the detection performance decreases 
because of the limited message diversity (smaller probability 
of receiving innovative messages) and the low connectivity. 
Similarly, Fig. [8] presents results for larger networks which 
are in accordance with the above. 

We then consider the case of multiple defective sensors. In 
Figs. |9] and [To] we present results for the cases with two defec- 
tive sensors {K = 2) in networks of 20 sensors. The results 
are given in terms of the average detection probability over 
dissemination rounds, for both fully and irregularly connected 
graphs. The master sensors are selected deterministically (DM) 
due to decoder design for multiple defective sensors identifica- 
tion. Note that this example violates the condition K <^ S and 
the performance of the detection algorithm is pretty poor. In 
addition, results for 5 = 70 and K ~ 2aie depicted in Figs.fTTI 
and [12] We focus on the evolution of the decoding probability 
and the average number of messages collected over rounds. 
From the evaluation it is clear that the detection performance 
is reasonable when the selected parameters value {L, a) favor 
diverse message generation. 
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Fig. 6. Simulation results for fully connected (FG), k = 16-regular connected (RG, k = 16) and k = 6-connected graphs (RG, fc = 6) with S = 20 
sensors, K = 1 and a random selection (RM) of L = 5 master sensors: (a) Probability of defective sensor detection; (b) Average rank of messages received 
per sensor. 




Fig. 7. Probability of defective sensor detection; Simulation results for iiTegular graphs (k > 3) and random selection (RM) of S = 20 sensors, K = 1. (a) 
L = 5 master sensors; (b) sensor participation constant a = qK = 0.7. 



TABLE I 

The theoretical measurement requirements for networks 
with s sensors. 





S 

K = 1 


=20 

K = 2 


S= 

K = 1 


70 

K = 2 


p G (0.9 - 1) 


130 


(115-244) 


(174-217) 


(125-284) 



In |fT9l . a centralized system has been proposed, which can 
be considered as dual to fully connected networks with cen- 
tralized tests (single master sensor that covers all the network). 
For comparison reasons, we compute the required number of 
measurements for networks with: {S = 20, K € {l,2},p G 
(0.9 ~ l),q e (0.15 - 0.3), p/i = 0.01, p/2 = 0.01) and 
(S'==70,/i e {l,2},pe (0.9-l),ge (0.15 - 0.3), p/i = 
0.01, p/2 = 0.01). The results are reported in Table U We 
observe that the worst case analysis leads to higher number 
of dissemination rounds than the real ones. However, these 
values decrease relatively to the growth of number of sensors 
in the network. Simulations show that in practice the required 
measurements are significantly fewer 

Detection probability comparison of the proposed method 
with several detection methods are illustrated in Figs. [13] 



and [141 for 20 and 70 sensors respectively. The proposed 
scheme outperforms all other methods. Note that the number 
of necessary rounds in RWGP scheme is large compared to 
the other schemes, while RW needs higher communication 
overhead for dissemination due to the transmission of raw 
sensor measurements. Average rank values over the network 
rounds are illustrated in Fig. [15] We observe that for the 
fixed detection probability p = 0.9 for the network with 
S G {20, 70} sensors the average number of system rounds 
required for the proposed method is approximately {13, 17} 
and {15,20}, respectively. The number of system rounds 
required by the other algorithms to reach the same probability 
of performance is higher, especially for the network with 70 
sensors. 

D. Communication overhead 

For the sake of completeness, we analyze the communica- 
tion costs of the proposed gossiping protocol and compare 
it with all other schemes under comparison. Let Rd and 
Id denote the number of bits needed for raw measurements 
transmission and sensor identifier, respectively. Recall that the 
tuple {S, L, Ln,n,T) stands for the number of sensors in the 
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Fig. 8. Probability of defective sensor detection; Simulation results for iiregular graphs (fc > 3) and random selection (RM) of S = 70 sensors, K = 1. (a) 
L = 5 master sensors; (b) sensor participation constant a = qK = 0.7. 




network, the number of master sensors (clusters), the number 
of neighbors that each master is connected with, the average 
number of sensors per cluster {n = S/ L) and the total number 
of transmission rounds. 

During the first phase of GP, the master sensors receive raw 
measurements from their neighbors. Thus, Ln-Rd bits are used 
for communicating these values. Further, the master sensors 
create binary messages and send them to their neighbors. 
Every neighbor requires knowledge about the identifier of 
sensors that participate in a test, thus the cost is I^ - \q{L+Ln)'\ 
bits, plus an additional bit in each message for sending 
the outcome result. Hence, the overall bit consumption is 
LnRd + Ln{Id\q{L + + 1). In the message exchange 

phase 5(1 + S) bits are required, from which 5 + 1 bits are 
reserved for the test outcome and the test matrix row W. 
Note that this analysis includes the full vector size and it can 
be further compressed. The overall number of transmitted bits 
over T rounds is given by: 

n^P = T [L„{i?d + Id \q{L + L„)l + 1} + 5(1 + 5)] . 

(27) 

We compared the communication costs of GP with the 
one of RWGP that takes place also in two phases. The first 



phase represents the random walk message collection, while 
the second is equivalent to the GP algorithm. Note that in the 
special case when RWGP and GP collect exactly the same 
data, they have identical decoding performance. However, 
if RWGP visits some sensors several times (more probable 
in irregular networks with a smaller connectivity degree), it 
performs worse than GP. In typical simulations, a random walk 
of RWGP terminates after n*^'' transmission round, where n is 
the number of elements per cluster in GP. RWGP transmits 
raw measurements, which results in Rd + IRd + • ■ • + riRd = 
{i+n)Rd i^jjg Therefore, the communication cost for RWGP 
is given by: 



'■RWGP 



{n + l)Rd 



L + S{l + S) 



(28) 



The bit transmission requirements for the RW algorithm 
is equivalent to that of the first step of RWGP, since it 
transmits also raw data. The detection is performed at nodes by 
comparison of known sensor values at that moment, without 
message design step. The number of transmitted bits is equal 
to: ripyy = T L. Recall that for transmission of a 

message to all the nodes in a fully connected graph, one 
requires log 5 transmissions. Therefore, the SF algorithm 
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Fig. 11. Simulation results for fully connected (FG) and iiTcgular graphs (IG), d > 3 with 5 = 70 sensors, K = 2 and deterministic selection (DM) of 
L = 10 master sensors: (a) Probability of defective sensor detection (b) Average rank value. 



requires in total n'gp ~ rRd log S bits. 

The comparison between the proposed method and all 
other schemes regarding the bits spent for communication is 
illustrated in Fig.[T6]for a fully connected graph. Note that the 
proposed algorithm in this setup requires only t = 15 rounds 
for efficient detection (Fig. |5]l, but it consumes approximately 
three times more communication overhead compared to that 
of RWGP algorithm. However, due to the specific collection 
approach (hops), the duration of one transmission round of 
RWGP lasts ten times longer than that of the proposed algo- 
rithm. From the figure we can observe that the RW algorithm 
has very small communication overhead. However, it requires 
significantly higher number of rounds (5* logs' ~ 130 rounds) 
compared to the detection time of the proposed GP algorithm. 
Overall, the proposed GP scheme is able to compete with the 
other schemes in terms of bits used untill detection. 

V. Conclusion 

In this work, we have addressed the problem of distributed 
failure detection in sensor networks. We have proposed a 
novel distributed algorithm that is able to detect a small 
number of defective sensors in a networks. We have designed 



a probabilistic message propagation algorithm that allows the 
use of a simple and efficient distance decoder at sensors. The 
transmitted messages are formed from local sensor observa- 
tions and they are communicated using a gossip algorithm. 
We have derived for the worst case scenario the lower bound 
on the required number of linearly independent messages that 
sensors need to collect per cluster to ensure detection of 
one defective sensor with high probability. We have shown 
experimentally that this number is quite smaller in practice, 
even for the small size networks, which confirms the validity 
of the theoretical bound. The experimental results have shown 
that the proposed method outperforms other detection schemes 
in terms of successful detection probability. The convergence 
rate is very fast, which largely compensates for the higher 
communication overhead. 
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Appendix 

A. Model for probability P[q\m) 

P{q\m) models the probability of event that multiple de- 
fective sensors are present in the same cluster but only a 
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Fig. 14. Compai'ison in terms of detection performance for networks with S = 70 sensors and L = 5 master sensors. Abbrevations: GP: Proposed method, 
RWGP: Random Walk rounds with the gossip algorithm with pull protocol dissemination, RW: Random Walk in the network initiated at L sensors, (a) fully 
connected sensor network; (b) irregular sensor network. 




Fig. 15. Average rank value for irregular sensor networks with L = 5 master sensors: (a) S = 20 sensors (b) S = 70 sensors. Abbrevations: GP: Proposed 
method, RWGP: Random Walk rounds with the gossip algorithm with pull protocol dissemination, RW: Random Walk in the network initiated at L sensors, 
SF: pull store-and-forward algorithm with a random choice of transmission message available at sensor, GSF: pull store-and-forward algorithm with a greedy 
choice of a transmission message available at sensor. 



subset of defective sensors participates in the test. This event 
introduces errors while detection of defective sensors. Recall 
that sensors participate in the test with the probabihty q. For m 
defective sensors possible message realizations are given with 
elements of the polynomial + (1 — q))"^- This polynomial 
represents the the binomial expansion of the form {x + y)'", 
with X = q and y = (1 — g). Polynomial expansion is equal 
to {x + y)™ = cqx™ + cixy"^^^ + • • • + c„ij/" and the 
coefficients Ci = (™) represent the numbers of i-th row of 
Pascal's triangle. Messages that do not cause decoding error 
are the messages of all zeros and of all ones. These messages 
occur with probabilities and (1 — g)™, respectively and they 
have coefficients equal to 1. Note that (g+(l — g))'" = 1™ = 1 
and that probability of error event is therefore equal to: 

1 - o" - fl - oV" 

P{,\m) = i =l-q--{l- qr. (29) 



15 




(a) (b) 

Fig. 16. (a)Comparison of the communication overhead for several algorithms, for the following parameter values: {S, L, Ln,a, Rd, Id,''') = 
(70, 5, 50, 0.7, 7, 7, 80). Graph is fully connected. Abbreviations: GP: Proposed method, RWGP: Random Walk rounds with gossip algorithm and pull 
protocol dissemination, RW: Random Walk in the network initiated at L sensors, (b) Comparison of detection vs. number of rounds of the distributed 
detection scheme. 



